Rank in Wordlist | Frequency | Word |
---|---|---|
11152 | 2 | ১,০০০ |
11157 | 2 | ১০০,০০০ |
11214 | 2 | ২,০০০ |
11236 | 2 | ৩০,০০০ |
11322 | 1 | 65,328,121 |
11377 | 1 | Board of Intermediate and Secondary Education, Jessore |
11865 | 1 | s,p,d,f |
11899 | 1 | £১৩,৭২৫,০০০ |
11900 | 1 | £৬০০,০০০ |
12132 | 1 | অধিগম,পর্যপ্তি,শ্রবণ,জিজ্ঞাসা |
Rank in Wordlist | Frequency | Word |
---|---|---|
6896 | 3 | ১০০% |
6920 | 3 | ৩০% |
6925 | 3 | ৫০% |
6928 | 3 | ৬০% |
6933 | 3 | ৯০% |
11155 | 2 | ১০% |
11227 | 2 | ২৩% |
11230 | 2 | ২৪% |
11231 | 2 | ২৫% |
11252 | 2 | ৫% |
Rank in Wordlist | Frequency | Word |
---|---|---|
11532 | 1 | K&R |
Rank in Wordlist | Frequency | Word |
---|---|---|
11724 | 1 | US$১০লক্ষ |
Rank in Wordlist | Frequency | Word |
---|---|---|
11273 | 1 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
2631 | 7 | দু'টি |
5768 | 3 | ডিএনএ'র |
6915 | 3 | ১৯৮০'র |
6934 | 2 | .' |
7795 | 2 | কার্মা-ব্কা'-ব্র্গ্যুদ |
8676 | 2 | দু'টিতে |
8677 | 2 | দু'টো |
10088 | 2 | মি-লা-থোস-পা-দ্গা'র |
10188 | 2 | ম্ঙ্গা'-রিস |
11307 | 1 | 24°02'03 |
Rank in Wordlist | Frequency | Word |
---|---|---|
13647 | 1 | আর+ভি |
13964 | 1 | ইউটিসি+০০:৩০ |
Rank in Wordlist | Frequency | Word |
---|---|---|
8426 | 2 | টিসিপি/আইপি |
11320 | 1 | 60MR/MT |
11474 | 1 | GNU/NetBSD |
11475 | 1 | GNU/kFreeBSD |
11590 | 1 | Mozilla/ |
11662 | 1 | Read/Write |
11705 | 1 | TeleTYpe/TeleTYpewriter |
11997 | 1 | অগ্মেন্টিন/ক্লা |
13015 | 1 | আই/ও |
14000 | 1 | ইউরোপীয়ান/দক্ষিণ |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots